07:24
2026-07-01
machinebrief.com
large-language-models
Revealing Backdoors in LLMs: New Detection Framework Emerges
Researchers have developed a new framework for detecting backdoor attacks in large language models, addressing the challenge of discrete input spaces. The framework introduces Class Subspace Orthogonaβ¦